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Abstract 

We consider the problem of estimating the average tour length of the asymmetric TSP arising 
from the disk scheduling problem with a linear seek function and a probability distribution on 
the location of I/O requests. The optimal disk scheduling algorithm of Andrews, Bender and 
Zhang is interpreted as a simple peeling process on points in a 2 dimensional space-time w.r.t the 
causal structure. The patience sorting algorithm for finding the longest increasing subsequence 
in a permutation can be given a similar interpretation. Using this interpretation we show that 
the optimal tour length is the length of the maximal curve with respect to a Lorentzian metric on 
the surface of the disk drive. This length can be computed explicitly in some interesting cases. 
When the probability distribution is assumed uniform we provide finer asymptotics for the tour 
length. The interpretation also provides a better understanding of patience sorting and allows us 
to extend a result of Aldous and Diaconis on pile sizes 

1 introduction 

Modern disk drives have the ability to queue incoming read and write requests and to service them 
in an out of order fashion. In the Batched disk scheduling problem we are given a batch of n 
queued requests and we wish to service them in an order which minimizes the total service time, 
or equivalently, in an order which minimizes the number of disk rotations required to service all n 
requests. 

Data locations on a disk may be specified by a radial coordinate r which measures distance from 

the inner radius of the disk and by an angular coordinate which measures the angle with respect to 

some fixed ray. The seek function f{9) is defined to be the radial distance which the disk head can 

travel, starting and finishing at rest in the radial direction, while the disk rotates through an angle 6. 

The mechanics of disk head motion dictate that the function / is convex. The choice of / characterize 

a model of the physical disk. For a pair of disk locations P, Q we let df(P, Q) be the time required 
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to move the disk head from location P to location Q, with no radial velocity at the beginning and 
end of the movement, df always satisfies the triangle inequality but fails to be symmetric. The 
disk scheduling problem is the traveling salesman problem on n disk locations with distance function 
df. In general this problem is NP-Complete, [2]. In this paper we assume that the seek function is 
linear, f(9) = c8. Since c determines the function / within the family of linear functions we will use 
c to denote the function instead of /. While linearity of the seek function is a simplistic assumption 
many insights can be gained from studying this case, some of which can be applied more generally. 
In the linear case Andrews, Bender and Zhang (ABZ) have found a polynomial time algorithm for 
computing the optimal tour. Our goal is to provide average case analysis for the length of the optimal 
tour and a better understanding of the ABZ algorithm. 

To that end we consider a density function p(6, r)d9dr defined on the locations of data on the 
disk which expresses the popularity of the data residing at location (r, 9). We let L n ^ p denote the 
random variable which assigns to n data locations Pi, P n , sampled in accordance with the density 
function p, the length of the optimal tour between them w.r.t the distance function d c . Given c and 
p the random variable L n ^ CtP concentrates w.h.p around it's mean value which asymptotically has 
the form m C)P y/n. We provide a geometrical description of m c , p in terms of space-time (Lorentzian) 
geometry. A space-time (A compact Lorentzian manifold with boundary) provides both a natural 
way of sampling points via the volume form and (at least locally) a natural partial order which 
in relativity theory is interpreted as the past-future causal relation, [8]. We thus obtain a natural 
random poset structure, the causal structure on n points sampled w.r.t the volume form, such 
objects are known as causal sets in the (quantum) gravity literature, see [15] for a survey. We show 
that the ABZ algorithm is intimately connected to a simple peeling process applied to causal sets 
arising from the Lorentzian metric ds 2 = j;p(6,r)(dr 2 — c 2 d9 2 ) on a disk drive. Another algorithm 
(process) which can be interpreted in a similar fashion is patience sorting which computes the longest 
increasing subsequence (l.i.s) in a permutation. Patience sorting is a peeling process applied to causal 
sets associated with metrics of the form ds 2 = 4p(x, y)dxdy, where p is a probability distribution 
on the unit square in the plane. The fact that increasing subsequences can be viewed as chains 
in a causal set was noted in [6]. As a result of the above observations we show that the optimal 
tour length and the length of a l.i.s can be estimated by the diameters of their associated space- 
time models. For the l.i.s problem this is essentially a reformulation of a result of Deuschel and 
Zeitouni, [10]. When p depends only on r we obtain the explicit formula m c , p = Jq ^/p(r)dr. To 
simplify matters we first develop these computations without reference to space-time geometry. The 
Lorentzian geometry interpretation also leads to further information on the behavior of the ABZ 
and patience sorting algorithms. In particular we generalize a result of Aldous and Diaconis on the 
statistics of pile sizes in patience sorting. The framework provided by the paper can also be used 
to analyze other discrete processes such as airplane boarding, [4], and the polynuclear growth model 
(PNG) in statistical physics, [14]. 
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We note that the analysis of disk scheduling (along with airplane boarding) seems to be the first 
application of Lorentzian geometry outside the domains of physics. 

In the last section we take p to be uniform. We can then use some refined estimates on the size 
of the longest increasing subsequence in random permutations to obtain better bounds on the tour 
length. 

For the convenience of the reader we have also added an appendix containing a very brief intro- 
duction to Lorentzian geometry and it's relativistic interpretation. 

Some of the results of the present paper are based on the first part of [3]. The Lorentzian 
interpretation and associated results are new to the present paper. [3] also contains some asymptotic 
results for more general convex seek functions. 

2 The disk scheduling problem 

In this section we describe the batched disk scheduling problem and briefly survey some results on 
increasing subsequences which are used later on. The batched disk scheduling problem was formally 
introduced in [2] section 2 which we follow closely in the next subsection. 

2.1 The batched disk scheduling problem 

A computer disk has the shape of an annulus, which geometrically can be thought of as a cylinder 
C. For convenience we normalize the radial distance between the inner and outer circles to be 1. 
Each point on the disk (cylinder) is represented by coordinates, R = (6,r) where < r < 1 is the 
radial distance from the inner circle and < 9 < 1 is the angle relative to an arbitrary but fixed 
ray. A complete circular angle is chosen to be of 1 unit instead of 2ir for the convenience of future 
computations. The points (0, r) and (l,r) are thus identified. 1 

The disk drive has a recording head which is used for reading/ writing data from/to the disk. 
The head is placed on an arm that can move radially in and out. In addition the disk rotates at a 
constant speed. The seek function f(9) represents the maximum radial distance that the disk head 
can travel starting and ending with no radial motion, while the disk rotates through an angle 6. Note 
that the angle 9 is not limited to be less than 1 since moving a large radial distance may take more 
than one rotation of the disk. We will assume that / is linear, f(9) = c9. 

Consider the infinite strip U given by < r < 1, — oo < i < oo. We consider the operator T on 

U which is defined by T(t,r) = (t+l,r). The cylinder C, representing the disk, is the quotient of 

U with respect to the action of T. The quotient map it : U — ► C is given by 7r(t, r) = [t mod 1, r). 

We also let I 2 C U denote the set of points in U with < t < 1 which is the unit square. I 2 is a 

fundamental domain for the action of T on U. We think of the t coordinate as representing time, in 

X A disk drive actually consists of several annuli (platters). However this fact will have little significance to what 
follows 
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units of complete disk revolutions. We normalize the time coordinate in such a way that the head 
of the disk is at angle at time 0. It follows that t Mod 1 is the angle of the disk head at time t. If 
the disk head at time t is in radial position r then the disk position of the head is ir(t, r) G C. 

Let / be a convex function. We define a partial order < j ^ or on U. We say that (t\, r\) <hor (h, r 2 ) 
iff t 2 > t\ and f(t 2 — t±) > \r 2 — r±\. It is easy to verify that <f : hor is indeed a partial ordering. If / 
is the seek function of a disk we may interpret this relation as follows, (t±,ri) <hor (t 2 ,r 2 ) iff a disk 
head which at time t± is in disk location 7r(ti,ri) can reach at time t 2 the disk location Tt(t 2 ,r 2 ). 

Given a set of I/O requests to data locations, {R±, R n } C C, we formally add a request 
Ro = (0, 0) and denote the resulting subset of C by R. We also consider the set R u = 7r _1 (^). 

A tour of R is a chain in the poset (R u , <hor) starting at (0,0) and ending at (0, k), for some 
k > 0, and containing one element from each set 7r~ 1 (Ri) for all i = 1, ...,n. A tour can be thought 
of as giving the order and times in which the requests Ri are to be serviced. The chain condition 
guarantees that the head of the disk can reach a request, from the location of the previous request, 
on time to provide service. We define the service time of the set of requests R, denoted ST(R) to 
be equal to the minimal integer k for which there a tour of R starting at (0, 0) and ending at (0, k). 
Stated otherwise, the service time is the minimal number of disk rotations needed to service the 
requests in R. 

Let g be a concave function. We define a partial order <g jVer on U. We say that (ti,r±) <g,ver 
(t 2 ,r 2 ) iff 5(1*2 - h\) < \r 2 - n|. 

From now on we restrict our discussion to linear functions / and omit / (or c) from subscripts. 
When considering a finite set S in U, C or I 2 we shall always assume that for any pair of points 
(t2,r 2 ) we have \f{\ti — t 2 \) ^ \r\ — r 2 \- By definition a set S C U is a chain with respect 
to <hor iff it is an independent set with respect to < ver and likewise S is an independent set with 
respect to <hor iff it is a chain with respect to < ver - We shall refer to this property by saying that 
< ver and <hor are complementary. 

We define < ver and < ver on the unit square I 2 by restriction from U. We can also lift the relation 
<ver to C by letting R± < ver R 2 , R±,R 2 £ C iff there exist Qi,Q 2 £ U such that n{Qi) = R% and 
Qi <Der £?2- We note that for Qi,Q 2 £ U, such that Q\ < ver Q 2 we have r(Qi) < r{Q 2 ). It is easy 
to verify using this fact that < ver defines a partial order on C. If Q\ < ve r Q2 <t>er ••• <ver Qn is a 
chain of < ver in U then by definition the ordered set Ri = ir{Qi) is a chain for < ver in C. Conversely, 
given a chain R\ < ver R 2 < V er ■■■ <ver Rn in C we can form a chain Q\ < ver ■■■ <ver Qn in U such 
that ir(Qi) = Ri. We can construct the lifted chain as follows. Assume that R\,...,Ri have been 
lifted to Qi, ...,Qi. By the definition of < ver in C there are Qi,Q' i+1 G U such that Q[ < ver Q' i+1 
and such that 7r(Q^) = Ri and Tr(Q' i+1 ) = R4+1. Since ir(Qi) = Tr(Q'i) we have an integer k such that 
Qi = T fe (<5'j)- Let Qi+i = T k (Qi + i)' then by T invariance of the relation < ver in U we obtain the 
desired extension of the lifted chain. 

Given a finite partially ordered set (S, >), there is a simple and well known procedure which 
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simultaneously finds a minimal decomposition of S into independent sets and a maximal chain in S. 
Consider the set S\ of minimal elements in S with respect to >, let Si be the set of minimal elements 
in S — U^Z^Sj. For each element of s G Si construct pointers to all elements s' G for which 
s > s'. Let Sk be the last nonempty set thus defined. We note that the sets Si must be independent 
sets. All maximal chains in S are obtained by following pointers from elements of Sk all the way 
back to S\. The length of the maximal chain is obviously k which must therefore be the size of a 
minimal decomposition into independent sets. We note that the process can be applied to infinite 
posets as long as the maximal length of a chain which ends at a given element s G S is bounded. 
The origin of this simple process may be traced back to G. Cantor's work on ordinal arithmetic. 

2.2 The optimal algorithm of Andrews, Bender and Zhang 

We consider an algorithm of M.Andrews, M. Bender and L.Zhang for finding the optimal tour (up to 
a bounded additive constant) for servicing a set of requests R, [2]. We rephrase their algorithm in 
terms of the poset peeling process which was introduced earlier. 

Let R C C be the set of requested data locations. Let W\, ...,W m be the sets obtained in the 
peeling process applied to the poset (R u , < ver )- Since the W{ are independent sets with respect to 
< V ert they form chains w.r.t <hor- the chain structure provides a linear order on Wj. The number of 
independent sets is by the properties of the peeling process equal to the size of of the maximal chain 
in RF w.r.t < ver . We denote the size of the maximal chain by Mu^ ver (R u ) . By the chain lifting 
construction of the preceding subsection this equals Mc, V er(R), the maximal chain in R as a subset 
of C w.r.t < ver . 

Since the map T is a partial order preserving isomorphism of U onto itself, the sets W, are 
invariant under T and hence have the form 7r _1 (S'j) for some Si C R. 

Given a chain of elements vi, ..,vi in a poset we will denote by pred(vi) the predecessor of v.- L in 
the chain, namely, the element Vi-\. 

The ABZ Algorithm: (see [2]) Start at the point (0,0) G U. proceed to the first point 
1171,1 G W\ for which w\ t \ >hor (0,0). Let W7i,i, H7i,2, ^IjXi) = pred(T(wi t i)), be the elements of 
R u in the chain W\ between w\^\ and T(wi j ±). Similarly, for general 1 < i < m, let u^i be the 
minimal element w G W, for which w >hor w i-i,j(i-i)- Let iWj ; i, Wij(i) = P r ^d{T{wi ) i)) be the 
elements of RF in the chain W{ between ti^i and T{w^\). Let k be the least integer for which 
(0, k) > hor 

w m,j(m)) ■ The output of the algorithm is the concatenated tour 
7r((0, 0)), Tr(w hl ), ...,ir(w 1J(1) ),ir(w 2 ,i), ^(w m>j(m) ), vr(0, k). 
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3 Estimating the optimal tour length 

3.1 A combinatorial estimate 

In order to analyze tour length for this algorithm it will be convenient to consider a slightly modified 
version of the ABZ algorithm. 

Consider first the piecewise linear curves L\ in U which join successive points of Wi by straight 
line segments. Since the Wj are chains and the seek function / is linear the curves L\ form continuous 
chains with respect to <hor- We also note that each element of Wj+i lies above the line L[ since by 
definition an elements of Wj+i £ Wj+i dominates an element of Wi £ Wi w.r.t < ver . If the line L\ 
would have passed above we would get a point l\ of L\ with the same t coordinate as Wi + \ which 
dominates Wi + \ and hence dominates Wi w.r.t < ver - By complementarity we conclude that there is a 
point l\ € L'i which is independent of u>i 6 Wi w.r.t <hor contradicting the chain condition. The L\ 
are also invariant under T since Wi is T invariant. Finally we note that the piecewise linear curves 
L\ can be considered as graphs of functions on t. Let L\ = L[ and define the curves Lj inductively 
to be the graph of the maximum of the functions L^(i) and Lj_i(t). Since by the argument above 
points of Wi lie above Lj_i and by definition lie on L\ they also lie on Lj. By definition the curves 
Li form chains w.r.t <hor an d at Every point Lj+i lies above or coincides with Lj. Lj are again T 
invariant by definition. 

Finally consider the line segment J defined by r = ct, passing from the point (0, 0) to the point 
(1/c, 1). By definition J forms a chain w.r.t <hor- Let £j ; i be the first meeting point of J with Lj. 
Since J begins at the point (0, 0) and Lj+i is above Lj we see that Zj+i,i is either further along J or 
equal to Zj^. We let (0,0) be Iq 

We can now state the modified algorithm 

Modified ABZ algorithm: 

Let k,i,k,2, ■■■,k,h(i) = T(li : i) be the points of R u on Lj between Z^i and T(/j i i), inclusive. The 
modified ABZ tour consists of the concatenation 

(O^),/!,!,...^^,!),^!),...,^- 1 ^!),...,^^!),..,^^!)^^^), where k is the smallest inte- 
ger such that T m (l mA ) < hor (fc,0). 

Theorem 3.1 M c , V er{R) - 1 - 1/c < ST(R) < M c , V er{R) + 1 + 2/c 

Proof : We consider the upper bound first. We have t{T m (l m) i)) = m + £(Z m ,i) ^ m + l/ c - The 
point q = {t(T m (l mA )) + cr(T m (l mA ),0) satisfies g > ftor T m (/ mjl ), hence 
fc = [t(T m (/ m>1 )) + (l/c)r(T m (l mA )} + 1 < m + 1/c + 1/c + 1 = m + 2/c + 1. 

We now consider the lower bound. Let Qi,Q2 €. U such that Q2 > V er Qi- Let Q2 ^ ^ be 
such that ^(Qg) = ^((^2) and Q 2 >^ or Qi- If i(Q2) > t(Qi) then by definition of the relation 
< ver we have (t,r(Q2)) > V er Qi for all t{Q\) <t< t{Q2). We conclude by complementarity that 
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t(Q' 2 ) > t(Q 2 ) + 1 = t(Qi) + (t(Q 2 ) - i(Qi)) + 1. If i(Q 2 ) < i(Qi) or Qi >„ er Q 2 then obviously 
the same inequality holds. Let R^, Ri m be a maximal chain w.r.t < ver among the requests R. 
Let Q±, ...,Qm £ U be a lifting w.r.t ir of the maximal chain to a maximal chain w.r.t < ver on U. 
Consider an optimal tour in U. Assume that the points Qi are indexed by the order in which the 
optimal tour visits their equivalence classes w.r.t T, this does not coincide necessarily with their 
order in the chain. Consider Q\ and Q2 let Q\ be the point in the equivalence class of Qi which the 
tour visits. By the chain condition we have either Q\ > ver Q2 or vice versa, in any case we conclude 
that t(Q' 2 ) > t{Qi) + (t(Q 2 ) - t(Qi)) + 1. More generally, assume that Q\ = T h (Qi). Applying the 
argument above to Q\ and T h (Q i+1 ) we conclude that t(Q' i+1 ) > t(Q[) + t(T h (Q i+1 )) - + 1 = 

t{Q'i) + (t(Q l+ i) - t{Qi)) + 1, which leads to t(Q' m ) - > (m - 1) + Y.1=i KQi+i) - t(Qi) = 

{m-l) + {t{Q m )-t{Qi)). Since Qi and Q 

m &re comparable w.r.t < V er we have t{Q m ) — t(Qi) > — 1/c 
which leads to the desired estimate, q.e.d 

3.2 The probabilistic setting 

We wish to provide average case analysis for the service time ST(R) as the number of requests, n, 
goes to infinity. We assume that we are given a probability distribution fi p = p(6,r)d9dr, where we 
will assume that p is smooth, on disk locations which reflects the popularity of the data stored in 
various disk locations. We assume that the n requested locations are sampled independently from the 
distribution fi p . We may also consider a dependent sequence of requests in which case p will denote 
the asymptotic distribution of requests, assuming one exists. We will be interested in asymptotic 
behavior of ST(R) as n tends to infinity. In this context the phrase "with high probability", w.h.p 
for short, refers to an event that occurs with probability approaching 1 on the probability space of 
all sets of n requests as n approaches infinity. 

Remark: We remark that in practice p will depend only on r and not on 9. This is due to the 
way data is laid out on a disk. Logically consecutive blocks of data are laid out in tracks for which 
r is fixed. To produce bias in the angular direction 9 an application would have to request blocks 
which are in arithmetic progressions with jump d equal to a multiple of the number of blocks in a 
disk track. Applications are usually unaware of the physical layout of data on disks, It is therefore 
very improbable that an application would randomly or even intentionally be able to produce bias 
in the 9 direction. On the other hand radius dependent bias occurs very frequently. Empty regions 
of the disk are of the form n < r < r 2 because of data layout, also, files occupy regions of this form 
resulting in bias due to differing file popularities. Nonetheless, it is instructive to work out the case 
of a general function p(9, r). 
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3.3 Increasing subsequences 

We say that a sequence of points z\ = (xi,yi), ...,Zk = (xk,yk) in the plane is increasing if Xi > Xj 
iff Di > Uj for all 1 < i,j < k. 

We can define a partial order >j nc on points of the plane by saying that (x±, y\) >i nc (x2, 2/2) iff 
x\ > X2 and y\>y2- An increasing subsequence is by definition a chain in this partially ordered set. 

Let fi q = q(x,y)dxdy be a distribution on a square with vertices (0, 0), (0, a), (a, 0), (a, a) with 
bounded density function q. The following theorem of Deuschel and Zeitouni computes the asymp- 
totics of the longest increasing subsequence among n points in the unit square sampled w.r.t [i q . 

Theorem 3.2 ([10]) 

Let S be a set of n points in the unit square chosen with respect to fi q . Denote by K the size of 
the largest increasing subset of S w.r.t <i nc - Then 

1) For all e > 0, w.h.p, \K — l rn ax\/n\ < £y/n. Here £ ma x is given by ( m ax = Max^i^), where 
i(4>) is the functional 



and 4> runs through all differ entiable nondecreasing functions on the unit interval with boundary 
conditions 0(0) = and <p(a) = a. 

2) For any e, 5 > 0, w.h.p, an increasing subset of size (K — e)y/n can be found in a 5 neighborhood 
of (f) if 4> maximizes the functional l{<f>). Here a 5 neighborhood refers to all points which are 
at a distance less then 5 from a point of the form (x, 4>{x)). 

3.4 Estimating M C:Ver (R) 

Assume that a set of n I/O requests, R = {R\, R n } are chosen w.r.t the location distribution 
given by p(9,r)d0dr. Our goal is to study the asymptotics of the random variable ST(R). By 
theorem 3.1 we may equivalently study the asymptotic behavior of the random variable Mc, V er(R)- 
We do so using theorem 3.2. We define the pull back pjj of the function p(9, r) to U by letting 
Pu(t,r)) = p(ir(t,r)) for (t,r) £ U. We recall that the seek function / is given by f(t) = ct. 

Theorem 3.3 1) w.h.p \ Mc, V er (R) — l(c,p)\' r n\ < Sy/n, where l(c,p) is given by Max^l(ip) and 




(1) 




(2) 



and the maximum is taken over all differentiable functions ip(r) with \vp'(r)\ < 1/c. 
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Proof: We prove the theorem first for the case of c = 1. Define the rectangles D it j i = 0,1,2,3, 
j = —4,..., 4 in U consisting of the points with coordinates i/4 < r < (i + l)/4 and t Mod 1 
with j/4 < t < (j/4) + 1. Given a chain M in C w.r.t < ver we can find a function g = gu 
g : {0, 1, 2, 3} — > {—4, ...4} and a lifting of M as constructed in the previous section which is a chain 
in D g = Uj Z)j 9M (j), i = 0, 1, 2, 3. This is the case since the 9 coordinate of elements in M can vary 
by at most 1/4 as r varies by 1/4 and any 9 interval of size 1/4 thereby missing at least one vertical 
line of the form j/4. We conclude that the length of the maximal chain is the maximum, over all 
functions g as above, of the maximal size chain in D g . We further note that each D g is a fundamental 
domain for the translation action T on U and hence, the probability distributions p(9,r)d9dr lifts 
to a distribution on D g . Fix a function g and a (piecewise) differentiable curve (r,<fr(r)) in D g with 
\<f>'\ < 1. We consider the size of a maximal chain in D g in an e neighborhood of the graph of (f>. It 
is easy to see that any chain will be in the e neighborhood of such a graph. 

Let S be the linear transformation which rotates the plane clockwise by 45 degrees. It is easy 
to verify that for points 1?2 in D g we have R\ < ver i?2 iff S(R\) <i nc S(R,2). We can now apply 
theorem 3.2 to S(D g ) with the induced distribution. Up to a translation, S(D g ) will be contained 
in a square as in theorem 3.2. We observe that curves (x,<f>(x)) with <p'(x) > are mapped by S 1-1 
to graphs of functions ip(r) which satisfy |V , '( r )| < 1- At the level of differentials we have along the 
graph of (f>, the formula dy = (p'(x)dx. Also dx = (dr + d9)/y/2 and similarly dy = (dr — d9)/y/2. 
We conclude that dxdy = 4>'{x)(dx) 2 = ((dr) 2 - (d9) 2 )/2 = ((dr) 2 (l - (^'(r)) 2 )/2. We see that the 
functional whose maximal value is l(c,p) coincides with the functional £((p) of theorem 3.2 as written 
in the 9, r coordinate frame via the mapping S . Sine© "^ver "^Inc 

are identified via T we obtain 

the theorem for < ver restricted to D g and since any chain of < ver in C can be lifted to one of the 
D g and the number of functions g is finite we obtain the theorem. 

For general values of c, the transformation V c (9,r) = (c8,r) transforms < c ,ver to <i tVer and we 
can apply the preceding arguments. The factor ^ comes from the effect of V c on the probability 
distribution. 

Let p = p(r) be a density which only depends on r. Let = vq < r\ < r<i < ... < r n _i < r n = 1 
and let dj i = 1, n be arbitrary non negative coefficients with 2~27=i d%( r % ~ r i-i) = 1- Let q = q(r) 
be a density function of the form J27=i c iXi r 1 r - where xi r 1 r 1S characteristic function of the 
interval [rj_i, ri\. We will assume that the set ro, r n contains the points 0, 1/4, 1/2, 3/4, 1. 

Since it is known that finite linear combinations of characteristic functions of intervals are dense 
in the space of Lebesgue measurable functions it is enough to prove the theorem for such densities. 

Consider the set Jj consisting of points (t, r) with rj_i < r < r^. Fix a function g as above 
and consider Jj >g = J« n D g . Since we have included the points k/4, k = 0, ...,4 in the set rj we 
have for any gi,g%,i a translation in U, T gu92 j : (t,r) — ► (t + t(g\, g2,i),r) which maps to 
Ji,g2- ^ e n °t e that translations preserve the functional in our theorem. The image of Jj i9 under S 
is a rectangle rotated by it/ 4. We can assume that the bottom vertex is the origin by shifting the 
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image. The vertices of the rotated rectangle are then of the form (0, 0), (—a, a), (b, b),(b — a,b + a) 
for some positive a, b. It is shown in [10] that given a constant density function on a convex set A 
and u = (xi,yi),w = (#2,2/2) G A, the straight line between u and w is the unique curve which 
maximizes the functional 1, subject to the initial conditions (p{x\) = y\ and <\>{x-i) = D2- Since q 
is constant on Jj j9 we see that the curve <p maximizing the functional on T(Jj i9 ) is a straight line 
between two boundary points u and w. The value of the functional £ max = £(<p) is then given by 
2\] q(x-i — xi)(t/2 — yi). We also notice that u must lie on the interval A = [(0,0), (—a, a)] and w on 
the parallel interval B = [(b, b), (b — a,b + a)]. Since both intervals have a slope of —1 we notice that 
in fact (x2 — x{) + (2/2 — Ui) is independent of u € A and w G B. The product is then maximized when 
X2 — x\ = 1/2 — yi in which case the line cp satisfies <p' = 1. Pulling back the result we see that any of 
the curves ip(r) = constant in Ji <g maximizes the functional among all curves in J,^ g and these are 
the only maximizing curves. Consider a vertical line of the form tp(r) = constant and an arbitrary 
non vertical curve ip. Let and be such that ip is contained in D g ^ and t/j is contained in D g ^y 
Let ipi = ip n J itgW and ^ = ip l~l The line T g ^y g ^^(ip) is a vertical line in J. ^ } and hence 

/(f/'i) = l(T g ^ g ^ j(V'i)) > K^)- Since V> is not vertical there exists an i$ for which the inequality is 
strict. Since l(ip) = J2i K^i) an d K^) = J2i K^i) we see that l(ip) > l(ip). Since this inequality holds 
for any density step function density q it holds for p(r). Computing / on an arbitrary vertical line 
yields the explicit formula, q.e.d 

As a consequence of the proof of theorem 3.3 we obtain the following corollary. 

Corollary 3.4 Let R C C be a set of N requests. The modified ABZ tour on R can be computed in 
time 0(n\og(n)). It provides a tour which is optimal within an additive constant. 

Proof: It is sufficient to establish the running time. By rescaling we can assume that c = 1. The 
main computational step in the modified ABZ algorithm is the computation of the peeling process 
decomposition W\, ...,W m for the set R u w.r.t < ver - When c = 1, points in I 2 can dominate or be 
dominated only by points in the set B C U satisfying — 1 < t < 1. Consequently, the restriction to 
I 2 of the peeling process applied to points in B coincides with the restriction to I 2 of the peeling 
process applied to all U. We have noted in the proof of theorem 3.3 that by a 45 degree rotation the 
relation < ver translates into the relation <i nc - It is therefore sufficient to apply the peeling process 
to the set of size 3iV, S(R U n B) w.r.t <j nc . It is well known that patience sorting, [1], provides an 
0{nlog(n)) algorithm for peeling a subset in the plane w.r.t <j nc . q.e.d. 

4 Lorentzian geometry and disk scheduling 

In this section we reformulate and extend the results of the previous section in terms of Lorentzian 
geometry. For the convenience of the reader we have added a very brief introduction to Lorentzian 
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geometry in the appendix. We will use some basic results on Lorentzian geometry in this section. 
These results can be found in [13], especially in chapter 7. We consider Lorentzian metrics on C 
the cylinder, U the infinite strip and I 2 the unit square. We recall the mapping ir : U — ► C, 
n(t,r) = (t Mod l,r). We consider the standard coordinates on them (9,r), (t,r) and (x,y). 

2 

Let p be a density function on either C or I 2 . We will define metrics by their associated forms 
ds 2 . Given p we define the metric gc,ver by ds 2 = |p(#,r)(<ir 2 — c 2 d9 2 ). We define the metric 
9u,ver on U by ds 2 = -p(ir(t, r))(dr 2 — c 2 dt 2 ). We define gu,hor = —gu,ver- On I 2 we consider the 
metric gp inc given by ds 2 = 4p(x, y)dxdy. For gc,ver and gu,ver we define the future pointing vector 
to be (0,1) globally. For gu,hor we define the future pointing vector to be (1,0) at all points and 
for gp inc we define the future pointing vector to be (1, 1) at all points. With these definitions 
it follows immediately that the partial orders we considered coincide with the past-future causal 
relation induced by the corresponding Lorentzian metrics. 

Remark: We can also define on C the metric gc,hor = —gc,ver- The metric gc,hor does not 
have an associated partial order (causal structure) since there are time-like closed curves, in fact any 
horizontal circle of the form r = constant is a closed time- like curve for this metric. 

Given a point z = (9, r) we let h(z) be the maximal length of a curve ending at z. A maximal 
length curve ending at z exists by corollary 7.7 of [13]. We will refer to h(z) as the height of the point 
z. Let C T denote the curve consisting of points z such that h(z) = r in the space-time (C,gc,ver)- 
The following result provides a reformulation and extension of theorem 3.3 in terms of Lorentzian 
geometry. 

Theorem 4.1 Let p(9,r) be a request probability distribution on a disk C. Let g be the Lorentzian 
metric induced by ds 2 = \p{9 ,r)(d9 2 — c 2 r 2 ). Let diam{C) denote the maximal length of a time-like 
curve in this model. Let R be a set of n requests chosen w.r.t to the density distribution p and let 
ST(R) be the number of disk rotations needed to service all the requests in R with an optimal policy, 
then: 



1) For all e > 0, w.h.p 

, ST(R) - diamiC) / _ 

I rj , I < £ V^ (3) 

2) When p = p(r) we have diam(C) = </| Jq 1 ^Jp(r)dr. 



2 Strictly speaking (8, r) is not a coordinate system on C due to the identification of (0, r) with (1, r). We can define 
coordinate patches on C similar to the subsets Df in the proof of theorem 3.3. The transition functions between the 
coordinates on different patches are given by translations (rotations in C) which do not affect the form of the metric 
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3) Let n(r) be the number of I/O which were serviced in the first T^/n disk rotations. For all 
e > we have w.h.p 

MT)-±fte(c h )dh. 



n 

where l(Ch) is the length of Ch w.r.t —g. 



<e (4) 



Proof: Given the definitions, parts 1 and 2 are simply a reformulation of theorem 3.3. We therefore 
need to prove part 3. Fix some angle 9. The function hg(r) = h((0,r)) is a strictly increasing 
function. Given some r we conclude that either there is a unique point of the form 9,g T (9) of height 
r, or h({9, r)) < r for all < r < 1. We define g T {9) to be 1 in the latter case and define C T to be 
the graph of g T . The curve Ch must be spacedike by definition and hence it is a causal curve for 
the metric —g, see [13] page 17 for a definition. It is also continuous at interior points of C. As a 
causal curve w.r.t —g it also has a finite length w.r.t —g according to [13] page 54. We also know 
that Ch is differentiable almost everywhere, [13] page 17. Let X(t) be the region bounded by Co and 
C T . By theorems 3.2, 3.3 and their proofs we know that for every S > 0, w.h.p, the I/O which were 
serviced during rotation T^fn lie in a 5 neighborhood of C T . Letting 5 tend to zero we see that for 

n(r)-(f p(8,r)d0dr)n 

each e > w.h.p 1 — e < < 1 + e. By equation (6) the Lorentzian area element is 

2p(9,r)d9dr hence J XT p(9,r)d9dr = \/2vol{X T ). It remains to show that vol(X T ) = Jq l(Ch)dh. For 
simplicity we shall presently assume that gh{9) < 1 for all 9, that is the curve Ch does not intersect 
the upper boundary curve r = 1. 

At this point we note that the d are the circles of radius h (from Co), X T is the ball of radius 
r and hence the desired formula is the classical formula of computing the volume (area) of a ball 
from the volumes (circumferences) of spheres (circles) , the proof of which follows from the fact that 
circles are perpendicular to (the geodetic) radial curves. 

We make this argument more precise. Let 21,2:2 £ C T , not necessarily different, and consider 
maximal curves 71,72 ending at z±,Z2 respectively. By proposition 7.8 of [13] 7$ are geodesies. The 
ji cannot meet at any point, assume to the contrary that they meet at a point y. The portions 
of 7i from Co to y must also be maximal and therefore of the same length. The 7$, being distinct 
geodesies must also have distinct tangent directions at y. Following 71 from Co to y and then 72 
from y to Z2 would yield a maximal curve ending in Z2 which is not differentiable at y and hence 
not a geodesic, see also [13] page 57 for this type of argument. By maximality, the 7$ (being radial 
curves) must also be perpendicular to all C\, h < r at points in which they are differentiable, see 
[13] theorem 7.27(b). Consider a point z £ Ch with more than one maximal curve. We shall call 
such a point a bad point. We denote by Z the set of all bad points. A point which is not bad 
will be called good. We may parameterize the past pointing geodesies ending at z £ Z by their 
tangent angle <f> at z. The set of angles 4> corresponding to maximal curves is closed since by upper 
semi continuity of the length function, [13] theorem 7.5, a limiting curve of maximal curves will also 
be maximal. We conclude that there are geodesic maximal curves 71,2,72,2 with endpoint z with 
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maximal and minimal tangent angles fa < fa. Given a bad z we may consider all points in C which 
lie in the curved triangle whose boundary curves are Co, 7i j2 and j2,z- denote this triangle by T z 
and let T = U ze zT z . Since maximal curves do not intersect, the points of T z , either do not lie on 
any maximal curve or, can only lie on maximal curves ending at z. On the other hand a point of C r , 
t < h, outside of T will lie on a unique maximal curve ending at a good point. To see this, assume 
that u £ C T is such a point and assume that no maximal curve passes through u. Let u\,U2 £ C T 
be the closest points to u in C T through which maximal curves pass. As argued before, such points 
exist since limiting curves of maximal curves are maximal. The maximal curves through ui,U2 will 
end at points z±, Z2- z\ / Z2 since otherwise u G T Zl by definition. Any maximal curve of any point 
in Ch between z\ and Z2 will meet C T in a point U3 in the interior of the interval of C T between 
u\ and U2 which contains u, resulting in a contradiction to the choice of U\,U2- For any bad z and 
r < h we consider the interval I TjZ = T z n C T . Since maximal curves do not intersect in C T , r < h 
we conclude that for any pair of bad points Z\ 7^ Z2 and any r < h I T>Zl and I TZ2 are disjoint. In 
particular the set of bad z is countable. Let B T = Ylzez K^r,z)- Since the intervals I TjZ are disjoint 
we have B T < £(C T ). Consider a point u £ I TZ . The distance between u and z is at most h — r 
since otherwise there would be a curve from Cq to z passing u of length greater than h. Let T dh 
be the set of point in T of height h — dh < r < h. We wish to show that vol(T dh ) = o{dh). For a 
given bad z we consider T z h = T z nT dh . We consider the Lorentzian metric at a small neighborhood 
of z. We assume that the neighborhood is chosen to be simply convex, [13] pages 5-6, so there are 
unique geodesies between any pair of points in the neighborhood, such neighborhoods always exist, 
see [13] page 5. We consider Riemann normal coordinates, see [13] pages 5-6 and [9] pages 112-113, 
on a small enough neighborhood of z. The metric in Riemann normal coordinates is constant up to 
second order terms, thus, we can compute lengths and areas up to a multiplicative error of 1 + dh 
assuming a constant metric. 

In these coordinates the geodesies emanating from z are given by straight lines. Also circles 
around z coincide with the circles in Minkowski space. The lines 71^ and 72,2 meet the curve Ch-dh at 
a distance of dh from z. T z h is contained in the sector U z , bounded by 7i )Z , 72,2 and the circle S z ^h of 
radius dh around z. Consider Vi, i = 1, 2, the meeting points of 7^2 and S z dh- I n the constant metric 
approximation The length of the geodesic between the Vi grows linearly in dh and can be written as 
c z dh, where c z depends on the angle 02,z — <fii,z an d the density p(z). We consider the circle section S z 
of radius dh between the Vi. Since the the geodesic is the maximal length curve between the meeting 
points we have c z dh > £(S Z ). By a simple calculation vol(U z ) = Jp =0 h£S ZjP dp < i{S z ^h)dh < c z dh 2 
in the constant metric approximation. We conclude that vol(T z ) < c z dh?(\ + dh). Notice however 
that replacing the intervals Ih-dh,z by the geodesies connecting their endpoints still leads to a causal 
curve, hence of finite length, /. Since ^ z c z dh + 0(dh 2 ) < I we conclude that J2 z °z is finite and 
vol(T d h) < {Ez c z)dh 2 {l + dh) = 0(dh 2 ) = o{dh). Let X dh = X h - X h _ dh . On X dh - T dh we 
can define a vector field by the tangent of the unique geodesic passing through a point. This vector 
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field is almost everywhere orthogonal to the level curves C T and in particular to C^. the area of 
X dh - T dh is then i(Ch)dh(\ + dh). Combining with the estimate for T dh proves the theorem. We 
now consider the case where the curve Ch does meet the upper boundary curve F given by r = 1. 
We define F^dh to consist of the points on z G F of height h — dh < h(z) < h. We can apply the 
preceding arguments with Ch replaced by the causal curve Ch,dh = Ch U (Fh : dh)- The curve Ch is the 
intersection of the monotone family of curves Ch,dh- Since the length of a portion of F is bounded 
up to a constant multiplicative factor by it's measure we see t{Ch,dh) converges to i{Ch) as dh tends 
to completing the proof, q.e.d 

Using essentially the same proof we obtain the following more general version which is stated 
entirely in Lorentzian geometric terms 

Corollary 4.2 Let C be a compact subset of a strongly causal Lorentzian manifold, which satisfies 
vol{C) = 2. Let < denote the causal partial order induced on C. Consider the density distribution 
p = vol/2 on C. For any pair of points x,y £ C with x < y let d(x,y) be the length of the maximal 
curve in C beginning at x and ending at y. Let L x , y ^ n denote the random variable which assigns to n 
points in C sampled according to p the length of the longest chain of the form x < z\ < ... < z^ < y. 
The following statements hold: 

1 ) for any pair of points x,y G C with x < y, w.h.p 1 — e < J^y-Q^ < 1 + £ 

2) Let n T denote the number of points in the first T^fn layers of the peeling process applied to n 
sampled points, then for all r > 0, 1 — e < t . r> ; „ ; < 1 + e. 

Example: Let us consider the second statement with C being the unit square and ds 2 = Adxdy. 
In this case the peeling process coincides with patience sorting, the layers corresponding to card piles, 
sec [1]. The curve Ch consists of the points satisfying 2^/xy = h or y = 4-. We use x to parametrize 
the curve and dy = hence the curve length is fl? ^dx = 2hln(j^). Part 2 then states that 

T 

asymptotically n T = JJ" hln(j^)dh thus reproving theorem 12 of [1] on pile sizes in patience sorting. 

Remark: An analogue of part (1) of the corollary for d dimensional flat Minkowski space is given 
in [7]. Combined with the techniques of [10] this can be extend to domains in arbitrary Lorentzian 
manifolds. We plan to pursue this line of reasoning in future work. 

5 Finer asymptotics for the uniform distribution 

In this section we consider finer asymptotics for ST(R) when the distribution p on C is uniform. 

Let X n be the random variable which counts the length of the longest increasing subsequence 
among n uniformly chosen points in the unit square. Let Y n be the corresponding "Poisonnized" 
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random variable which counts the length of the longest increasing subsequence for a Poisson process 
of intensity n in the unit square. We recall that an intensity n Poisson process will generate k 

k 

uniformly distributed points in the unit square with probability e — n^y and that the restriction of 
the points to disjoint subsets are independent processes. The statistics of Y n have been studied in 
great detail, see [5]. The following result of M. Lowe and F. Merkl, see [12] theorem 1.2, computes 
the moderate deviations of Y n and X n . 

Theorem 5.1 Let < t n < n s , for some 5 < 1/3 be a sequence such that t n — ► oo. Let l n = 



The same holds for X n . 

We also need the following result of K. Johansson. 

Theorem 5.2 ([11]), Let 5 > 0. Let Z n be the length of the longest increasing subsequence of an 
intensity n Poisson process restricted to the diagonal stripe given by \y - x\ < n 6 - 1 / 6 then Pr(Y n = 
Z n ) = l- e- nCW , where c{5) > 0. 

Using these two results we prove the following finer estimate on L n . 

Theorem 5.3 If f(9) = c6 and p is the uniform distribution then, for all e > 0, w.h.p 



with A = |(2c) -1 / 6 -e and B = 3 2 / 3 A + e. 

Proof: For simplicity we assume that c = 1. Consider the unit square I 2 C U with coordinates 
(9, r) and the partial order relation < ver restricted from U. Let e > 0. Consider the set I £ given by 
e < 9 < I — easa subset of either C or I 2 . When p is the uniform distribution or more generally 
of the form p = p{r) we know by the proof of theorem 3.3 that w.h.p the maximal chain in C 
will lie in a e neighborhood of a vertical line. Choose uniformly one of the lines say ip(r) = 9q 
for some 6q. Since no value of #o is more likely than any other value, we have with probability 
1 — Ae that 2e < 9q < 1 — 2e and thus the maximal chain is contained in I e . By definition any 
chain in I 2 w.r.t < ver is a chain in C w.r.t <c,ver- When e < 1/4 then the relations <c,ver and 
<u,ver coincide on any e neighborhood of a vertical line, to see this note that if P <c,ver Q then by 
definition there is a Q' such that T(Q) = T(Q') and r(Q') — r{P) > c(\t(Q') — t(p)\). By assumption 
\t(Q) - t(P)\ < 2e < 1/2 hence if Q' ± Q \t(Q') - t(P)\ > \t(Q') - t(Q)\ - \t(Q) - t(P)\ > 1 - 1/2 = 
1/2 > \t(Q) — t(P)\. We conclude that P <u,ver Q as desired. Taking the limit as e goes to We see 
that w.h.p Mc,ver(R) = Mp ver (R). As a consequence of theorem 3.1 we are reduced to finding finer 
estimates for M I 2 ver (R), where the requests are uniformly chosen. Rotating I 2 by 7r/4 and then 



t n n l l Q + then 



log(Pr(Y n > l n )) « -(4/3)^ 2 
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shifting and scaling the image we see that this is the same as finding M^ inc {R) where K is a diamond 
with vertices (0, — l/y/2), (l/\/2,0), (— l/y/2,0), (0, l/\/2) and R are uniformly chosen points in K. 
Let S n — ► be positive. Let xi = n i/ 6 -s^2 anc ^ cons ider the points Ai = (— Xj,— l/y/2 + Xj), 
i = l,...,^/ 6-5 ", which all lie on the bottom left boundary of K. Let B { = Ai + (\j \[2,\j \/2), 
all of whom lie on the top right boundary. We define Si to be the square with edges parallel to 
the axis and opposing vertices Ai and Bi. Let w = -^=n -1 / 6+ *" and define = Ai + (0,w), 
Di = Ai + (if, 0), Ei = Bi — (0,w) and Fi = Bi — (w,0). Let iJj be the hexagon whose vertices 
are Ai, Bi,C'i, Di, Ei, Fi. Hi is precisely the n -1 / 6+ * neighborhood of the diagonal in the square 
Si. Hi meets Hi + \ along the mutual edge whose endpoints are Ci = Dj+i and Fi = Ei + \. the 
interiors of the Hi are disjoint. The square Si has area 1/2. By scaling, an intensity n/2 Poisson 
process on Si is equivalent to an intensity n process on the unit square. By theorem 5.2 w.h.p the 
longest increasing subsequence in Hi coincides with that of Si for all i. We see that the longest 
increasing subsequence in K dominates the random variable Y n = Max(Y n / 2i i), i = 1, n 1 / 6-5 ™, 
where the Y n / 2: i are i.i.d with distribution Y n / 2 . Let e > 0. Since the 3^/2,? are independent 
Y > 2^nj2 + t n (n/2) 1 / & w.h.p if Pr(Y n/2 > V2n + t n (n/2) 1 /6) > n -i/6+ e> by theorem 5.1 the last 
inequality holds if t n = ^Zn 2 / 3 (n)(l — e') for all e' > 0. Letting e go to zero yields the lower bound. 

To obtain the upper bound we let Zi = nl / 3 + s i = 1, n 1 / 3 ^. We let Pj = (— Zi, —l/y/2 + Zj) 
and let Qi = Pi + (1/V2, l/\/2) Let « = (l/v/2)n- 1 /3+'5, T< = P< - (0,u) and ^ = Qi + (n,0). 
Let be the rectangle with sides parallel to the axis and whose opposite vertices are Tj and 
?7j. It is easy to see that K is contained in L = UijRij (in fact in the union of The Tj 

form the set of minimal points of the poset (L,inc), likewise Uj form the set of maximal points, 
consequently, every chain in L is contained in Rij for some pair i,j. The area of Ri^ is vol(Rij) = 
(l/V^ + n- 1 / 3 - 5 ) 2 = \(l + 2ra~ 1 / 3 - <5 + n~ 2 / 3 - 2<5 ). consider a pair z, j and let /c = \i-j\. the area of 
isvol{Ri :j ) = i(l + (fc + l)n- 1 / 3 - 5 )(l-(A;-l)n- 1 / 3 -' 5 ) = i(l + 2n~ 1 / 3 ~ 5 - (k 2 - l)n _2 / 3_2<5 ). Let 
rj be any real satisfying 5 < rj. We see that if k > 2n 1 ^ 6+v we have vol(Rij) < ^(1 — n _1 / 3+2 ^ _<5 )). 
Applying theorem 5.1 to ^(1 — n -1 / 3+2 ( T '~ 5 )) we see that if k > 2n 1 / 6+v the probability that the 
longest increasing subsequence in Rij has length greater than y/2n is at most e _n " for some v > 0. 
from our lower bound we conclude that with high probability the longest increasing subsequence in 
L is contained in some Rij with \i — j\ < 2n l ^ +ri . The number of pairs satisfying this relation is 
at most 4n 1 / 2+5 +'? We also note that vol(Rij) < vol(R iti ) = \{l + 2n~ 1 / 3 -< 5 + n" 2 / 3 - 2 " 5 ) and hence 
the distribution of the length of the longest increasing subsequence in Rij is dominated by that of 
Ri t i. Let m = vol(Ri : i)n. We conclude that the distribution of the length of the longest increasing 
sequence in L is dominated by Y = Max(Y mti ), i = 1, .., 4n 1 / 2+<5+T? . Using the union bound, applying 
theorem 5.1 as before and letting 5, rj tend to zero we see that w.h.p Pr{Y < v / 2^ + t„(n/2) 1 / 6 ) for 
t n = 3 2 / 3 ^n 2//3 (n)(l — e). Finally, our estimates can be depoissonized as in [12] or [5] section 6. q.e.d 

Acknowledgments: I would like to thank D. Berend, L. Sapir, S. Skiena and especially O. Zeitouni 
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Appendix: A very brief introduction to Lorentzian geometry 

We briefly consider some very basic material on Lorentzian geometry which is relevant to our 
application. Since Lorentzian geometry is intimately tied to relativity theory and in fact was invented 
to model it, we also give a brief account of the physical interpretation of various geometrical notions 
in the theory. For a concise but much more comprehensive mathematical treatment the reader is 
referred to [13] and for more on the physics to [9]. 

A Lorentzian metric on a domain D C R n with local coordinates xi,...,x n is given by a C°° 
mapping from D to 2-forms 

ds 2 = ^2,9ijdxidxj (5) 

i 3 

Were gij is a symmetric non singular n by n matrix with a single positive eigenvalue. The form acts 
like the square of a "distance" function between nearby points. A space-time is a manifold (with 
boundary) with a global Lorentzian metric. 

Example: Consider R n+1 , denote the first coordinate which we index as the O'th coordinate by 
t and the other coordinates by xi, ...,x n . We attach to all points in R n+1 the fixed diagonal matrix 
g with entries (1, —1, —1, —1). We have ds 2 = dt 2 — dx\ — ... — dx\. R n+1 equipped with this 
constant Lorentzian metric is known as flat Minkowski space. It was introduced by Minkowski to 
provide a mathematical (geometrical) model for special relativity theory and in particular the notion 
of space-time as opposed to space and time separately. 

Given the metric g, we say that a tangent vector v is time-like if vgv 1 > 0, null if vgv 1 = and 
space-like if vgv 1 < 0. We say that v and w are orthogonal if vgw f = 0. The metric g also induces a 
volume form which in local coordinates is given by the formula 

vol = yj \det(g)\dx\...dx n +i (6) 

When a curve <p(u) is piecewise differentiable the length of 4> which we denote by £((p) is given by 
the formula ^ $ (u)g(j)' {u) 1 du where <j)'(u) = (xi(u), x' n (u)) is the tangent of <fi at u. £((/)) can 
only be defined if $ {u)g<j)' (u) 1 > for all u. 

We see that length can only be defined for curves whose tangents are either time-like or light-like. 
The set of time-like vectors has two cone shaped components. When there are no closed time-like 
curves we can define a notion of past and future , causality, via a consistent choice at each point 
of one of the components, that is, a continuous vector field of non vanishing time-like vectors. The 
vectors in the chosen component will be called future pointing. We can then define a partial order 
on the points of the space-time by declaring that A < g B if there is a curve beginning in A and 
ending in B with future pointing (time-like) tangents. Physically, the partial order corresponds to 
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the notion of causality, past-future relation. We note that scaling the metric by a (point dependent) 
scalar function does not affect this partial order. Such changes to the metric are called conformal. 

Relativity theory models gravity using a Lorentzian metric on a manifold (the universe). The 
motion of a particle (small body) with positive mass is described by a curve with future pointing 
time-like tangents. Such a curve is known as a world line. The length of the curve is the time 
which passes when measured by a clock attached to the particle, this is known in physics as proper 
time. Massless particles travel along curves with null tangents. When no force other than gravity is 
exerted on the particle (free falling particle), the trajectory of the particle can be divided into small 
segments so that each segment maximizes proper time (curve length) among all paths with the same 
endpoints. Curves with this property are known as geodesies. Geodesies are always differentiable 
curves. In flat Minkowski space which models a world without gravity, such curves are straight lines. 
This is a restatement of Galileo's principle (the first law of mechanics) that a free particle will travel 
in a straight line at constant speed. 

In two dimensional space-time, if g is a Lorentzian metric than so is —g. The causal structures 
induced by g and —g are complementary in the sense that v is space-like w.r.t g iff it is time-like 
w.r.t —g. Consequently, we can use —g to measure the length of curves which are space-like w.r.t g. 
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